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Figure 1 (SEQ ID NO 1V Amino acid sequence of the A. chrysogenum protein encoded by a 
nucleic acid molecule of the invention (depicted from the N terminus to the C terminus) 



MASPIASAAL KARIRRPSML KKLCKPQDLM 
VEQNGLQGKL KYSLFVGASS GAETENRWAS 
HLSMFPVDLV YGYYTKDRPH NKLDWWEA 
EVNTSLPSFE GLHDITMTDL PPLRKPYLVM 
APNAEADESA NKIAGHLIEF FEHEVAHGRL 
KVWTEVIQDT FLDLFDSGRL DFATATSIRF 
EIIRRLGVIA MNTPVEVDIY AHANSTCVMG 
KTDPHGVSCI VPMCTHIDQT EHDLDVIVTE 
LTHYFEKAES DCLRKGWGHE PHLLFNSFDL 



HHFPNGSYIG WSGFTGVGYP KKMPTYMADH 
LDMIDRRTPH QVGKAISKGI NEGKIHFFDK 
TDIKEDGSIV PGASVGATPE LIQMADKIII 
GVEDRIGRTS IPIDPEKWG ILESDYQDAT 
PNSLLPLQSG IGNVANAIIG GLDNSNFRNL 
SPDGFRRFYD NWEAYYGKLL LRSQQVSNSP 
SRMLNGLGGS ADFLRSSKYS IMHTPSTRPS 
QGLADVRGLS PRERARVIIK KCAHPVYQPI 
HKALVEHGSM QKVGQW 
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Figure 2 (SEQ ID NO 2): Genomic DNA sequence of the coding region of a genomic clone 
of the new A. chrysogenum gene from the translation start codon (ATG) to the last coding 
codon (TGG). The introns are depicted underlined. A single strand in 5'-to-3' orientation is 
shown. 

ATGGCATCAC CAATAGCCTC TGCCGCCCTC AAGGCGCGCA TTCGCCGCCC CTCGATGCTC 
AAGAAGCTGT GCAAGCCCCA GGATTTGATG CATCACTTCC CCAATGGCTC GTACATTGGC 
TGGTCCGGCT TCACCGGCGT CGGCTACCCG A AGTGAGTTC CACCGTCATC CCGCTCCACA 
GTAGGCGCAG CCGGCCCGCT GACAGTCCCC GACAG GAAAA TGCCGACCTA CATGGCCGAT 
CACGTCGAGC AGAACGGCCT TCAGGGCAAG CTGAAGTACT CGCTATTCGT GGGCGCATCG 
TCGGGTGCTG AGACAGAGAA TCGCTGGGCG TCGCTCGACA T GAT T GAT AG GAGGACCCCG 
CATCAGGTCG GCAAGGCCAT CTCCAAGGGC ATCAATGAGG GCAAGATCCA CTTCTTCGAC 
AAGCATCTCT CCATGTTCCC CGTGGACCTT GTATAC GTAC GTCAACGATG ATCCCTTGGA 
ATGTGCATGT ACTACGAGTA CCTGGCGCTA ACATCCGGTC AG GGCTACTA CACAAAGGAT 
AGACCCCACA AC AAGCT GGA CGTGGTGGTG GTGGAGGCCA CCGACATCAA AGAGGACGGA 
AGCATTGTAC CCGGAGCTTC AGTCGGCGCG ACCCCCGAGC TCATCCAGAT GGC CGATAAG 
GTGAGCAATT TCGATTTCTA GCGGAGGGCG CAGCAGGACC TGACATCTCC CTGTGCAGA T 
CATTATCGAG GTCAACACCT CACTGCCTTC ATTCGAGGGT CTCCACGACA TCACCATGAC 
CGACCTGCCC CCGCTACGGA AGCCCTATCT CGTCATGGGT GTCGAGGACC GCATCGGCAG 
GACCTCTATC CCTATCGACC CCGAGAAGGT TGTAGGCATC CTCGAATCCG ACTACCAGGA 
CGCCACTGCC CCCAACGCCG AGGCCGACGA GAGTGCGAAC AAGATTGCTG GCCACTTGAT 
TGAGTTCTTC GAGCACGAGG TCGCCCACGG CCGTCTCCCG AACTCCCTCC TTCCCCTCCA 
GTCCGGCATC GGCAACGTCG CCAACGCCAT CATCGGTGGC CTCGACAACT CCAACTTCCG 
CAACCTCAAG GTCTGGACTG AGGTTATCCA GGACACCTTC CTCGACCTCT TCGACTCGGG 
CCGCCTCGAC TTTGCCACGG CCACCTCTAT CCGCTTCTCC CCCGACGGTT TCCGCCGGTT 
CTACGACAAC TGGGAGGCCT ACTACGGCAA GCTCCTCCTC CGCAGCCAGC AGGTGTCCAA 
CTCGCCCGAG ATCATCCGCC GCCTTGGTGT CATTGCCATG AACACCCCCG TCGAGGTCGA 
CATCTACGCC CACGCCAACT CCACCTGCGT CATGGGCTCG CGCATGCTCA ACGGCCTGGG 
CGGCTCCGCC GACTTCCTGC GCTCCTCCAA GTACTCTATC ATGCACACCC CGTCCACCCG 
CCCCTCCAAG ACCGACCCGC ACGGCGTCTC GTGCATCGTT CCCATGTGCA CCCACATCGA 
CCAGACTGAG CACGACCTCG ACGTCATCGT CACCGAGCAG GGCCTGGCCG ACGTGCGCGG 
CCTGAGCCCC AGGGAGAGGG CCCGCGTCAT CAT C AAG AAG TGCGCCCACC CGGTCTACCA 
GCCCATCCTG ACCCACTACT TTGAGAAGGC CGAGAGCGAC TGCCTACGCA AGGGCTGGGG 
CCACGAGCCC CATCTGCTCT TCAACTCGTT TGACCTGCAC AAGGCCCTCG TGGAGCACGG 
AAG CAT GC AG AAGGTCGGGC AGTGG 
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Figure 3 (SEQ ID NO 3): cDNA sequence of the coding region of the new A. chrysogenum 
gene from the translation start codon (ATG) to the last coding codon (TGG). A single strand 
in 5 -to-3' orientation is shown. 



ATGGCATCAC CAATAGCCTC TGCCGCCCTC 
AAGAAGCTGT GCAAGCCCCA GGATTTGATG 
TGGTCCGGCT TCACCGGCGT CGGCTACCCG 
GTCGAGCAGA ACGGCCTTCA GGGCAAGCTG 
GGTGCTGAGA CAGAGAATCG CTGGGCGTCG 
CAGGTCGGCA AGGCCATCTC CAAGGGCATC 
CATCTCTCCA TGTTCCCCGT GGACCTTGTA 
AACAAGCTGG ACGTGGTGGT GGTGGAGGCC 
CCCGGAGCTT CAGTCGGCGC GACCCCCGAG 
GAGGTCAACA CCTCACTGCC TTCATTCGAG 
CCCCCGCTAC GGAAGCCCTA TCTCGTCATG 
ATCCCTATCG ACCCCGAGAA GGTTGTAGGC 
GCCCCCAACG CCGAGGCCGA CGAGAGTGCG 
TTCGAGCACG AGGTCGCCCA CGGCCGTCTC 
ATCGGCAACG TCGCCAACGC CAT CATC GGT 
AAGGTCTGGA CTGAGGTTAT CCAGGACACC 
GACTTTGCCA CGGCCACCTC TATCCGCTTC 
AACTGGGAGG CCTACTACGG CAAGCTCCTC 
GAGATCATCC GCCGCCTTGG TGTCATTGCC 
GCCCACGCCA ACTCCACCTG CGTCATGGGC 
GCCGACTTCC TGCGCTCCTC CAAGTACTCT 
AAGACCGACC CGCACGGCGT CTCGTGCATC 
GAGCACGACC TCGACGTCAT CGTCACCGAG 
CCCAGGGAGA GGGCCCGCGT CATCATCAAG 
CTGACCCACT ACTTTGAGAA GGCCGAGAGC 
CCCCATCTGC TCTTCAACTC GTTTGACCTG 
CAGAAGGTCG GGCAGTGG 



AAGGCGCGCA TTCGCCGCCC CTCGATGCTC 
CATCACTTCC CCAATGGCTC GTACATTGGC 
AAGAAAATGC CGACCTACAT GGCCGATCAC 
AAGTACTCGC TATTCGTGGG CGCATCGTCG 
CTCGACATGA TTGATAGGAG GACCCCGCAT 
AATGAGGGCA AGATCCACTT CTTCGACAAG 
TACGGCTACT ACACAAAGGA TAGACCCCAC 
AC CGACAT CA AAGAGGACGG AAGCATTGTA 
CTCATCCAGA TGGCCGATAA GAT CAT T AT C 
GGTCTCCACG ACATCACCAT GACCGACCTG 
GGTGTCGAGG ACCGCATCGG CAGGACCTCT 
ATCCTCGAAT CCGACTACCA GGACGCCACT 
AACAAGATTG CTGGCCACTT GATTGAGTTC 
CCGAACTCCC TCCTTCCCCT CCAGTCCGGC 
GGCCTCGACA ACTCCAACTT CCGCAACCTC 
TTCCTCGACC TCTTCGACTC GGGCCGCCTC 
TCCCCCGACG GTTTCCGCCG GTTCTACGAC 
CTCCGCAGCC AGCAGGTGTC CAACTCGCCC 
ATGAACACCC CCGTCGAGGT CGACAT CT AC 
TCGCGCATGC TCAACGGCCT GGGCGGCTCC 
AT CAT GC AC A CCCCGTCCAC CCGCCCCTCC 
GTTCCCATGT GCACCCACAT CGACCAGACT 
CAGGGCCTGG CCGACGTGCG CGGCCTGAGC 
AAGTGCGCCC ACCCGGTCTA CCAGCCCATC 
GACTGCCTAC GCAAGGGCTG GGGCCACGAG 
CACAAGGCCC TCGTGGAGCA CGGAAGCATG 



WO 2004/026902 



4/18 



PCT/EP2003/0 10289 



Figure 4 (SEQ ID NO 4): Genomic DNA sequence of a BamHI/EcoRI fragment of a genomic 
clone of the new A. chrysogenum gene (a single strand in 5'-to-3' orientation is shown). The 
translation start codon (ATG) and the translation stop codon (TAA) of the coding region are 
depicted underlined and in bold type; the introns are depicted underlined. 



GAAG ATCGCATTTG GGCGAGTGGG CTAATAATGC CTGCTGCCTG CCTGTGGACG 
GTAAATGAAT TAGGTGGAAT GTGTCGCAAA TTGAGGGGAA TGGCCCCCTT AT CAT AT AAA 
GTGCCAATGC GATACTATGG CGTGGCGTGG GGTCGCGTCG GTGTCCGGCC GGTCGAACGG 
AGGTCCCGGC TATCAATAGG CGGTAGGCCG GCATTGAATC GGTTCCACCG TATTCCAGAC 
ACCCAAGGAA GGCCCGCCAC CCCCAGCTCC GGCCTGGGGA TAGCGCCGAG TGGAGCACTC 
ACGGGGGCCG TGTTTGACTC GAAGACGCGT CGTGATTGGC CAGAACTTCA TCCCCCTCTG 
CCAAGTATTG GTTCACGGGA TTCGGCGACG TCAACGACCC CACCGGCCCG GAT T AC AT AA 
GGTGCACTGC AGCTACTACG TAGTACTCGT ACTTGGGAAG GAGGGACCCT TGGGGTCGGA 
GGTTTTAAAG GCAATGGCTT CTTCGCTGGT CCACCCAACC TGACTCTCAC TCTCCCTTTT 
ACCTCGCTCC TCTGATTATT CCCTCGTCTG CGTCTGGATT TCATCTCTTT CCCCTCCCGG 
CCCCTTTGGA TCTCTGCTCT CCCCTCCTCT CTCCCCCGCA TTGGTGTGTA AAACCACTGT 
CCCGCGGCCT CGCGACGAGT GACGTACTGC AAGCCGAAAC CTCACAATCC CTTCCTCACA 
ATGG CAT C AC CAATAGCCTC TGCCGCCCTC AAGGCGCGCA TTCGCCGCCC CTCGATGCTC 
AAGAAGCTGT GCAAGCCCCA GGATTTGATG CATCACTTCC CCAATGGCTC GTACATTGGC 
TGGTCCGGCT TCACCGGCGT CGGCTACCCG A AGTGAGTTC CACCGTCATC CCGCTCCACA 
GTAGGCGCAG CCGGCCCGCT GACAGTCCCC GACAG GAAAA TGCCGACCTA CATGGCCGAT 
CACGTCGAGC AGAACGGCCT TCAGGGCAAG CTGAAGTACT CGCTATTCGT GGGCGCATCG 
TCGGGTGCTG AGACAGAGAA TCGCTGGGCG TCGCTCGACA T GAT T GAT AG GAGGACCCCG 
CATCAGGTCG GCAAGGCCAT CTCCAAGGGC ATCAATGAGG GCAAGATCCA CTTCTTCGAC 
AAGCATCTCT CCATGTTCCC CGTGGACCTT GTATAC GTAC GTCAACGATG ATCCCTTGGA 
ATGTGCATGT ACTACGAGTA CCTGGCGCTA ACATCCGGTC AG GGCTACTA CACAAAGGAT 
AGACCCCACA ACAAGCTGGA CGTGGTGGTG GTGGAGGCCA C C G AC AT C AA AGAGGAC GGA 
AGCATTGTAC CCGGAGCTTC AGTCGGCGCG ACCCCCGAGC TCATCCAGAT GGCCGATAAG 
GTGAGCAATT TCGATTTCTA GCGGAGGGCG CAGCAGGACC TGACATCTCC CTGTGCAGA T 
CATTATCGAG GTCAACACCT CACTGCCTTC ATTCGAGGGT CTCCACGACA TCACCATGAC 
CGACCTGCCC CCGCTACGGA AGCCCTATCT CGTCATGGGT GTCGAGGACC GCATCGGCAG 
GACCTCTATC CCTATCGACC CCGAGAAGGT TGTAGGCATC CTCGAATCCG ACTACCAGGA 
CGCCACTGCC CCCAACGCCG AGGCCGACGA GAGTGCGAAC AAGATTGCTG GCCACTTGAT 
TGAGTTCTTC GAGCACGAGG TCGCCCACGG CCGTCTCCCG AACTCCCTCC TTCCCCTCCA 
GTCCGGCATC GGCAACGTCG CCAACGCCAT CATCGGTGGC CTCGACAACT CCAACTTCCG 
CAACCTCAAG GTCTGGACTG AGGTTATCCA GGACACCTTC CTCGACCTCT TCGACTCGGG 
CCGCCTCGAC TTTGCCACGG CCACCTCTAT CCGCTTCTCC CCCGACGGTT TCCGCCGGTT 
CTACGACAAC TGGGAGGCCT ACTACGGCAA GCTCCTCCTC CGCAGCCAGC AGGTGTCCAA 
CTCGCCCGAG ATCATCCGCC GCCTTGGTGT CATTGCCATG AACACCCCCG TCGAGGTCGA 
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CATCTACGCC CACGCCAACT CCACCTGCGT 
CGGCTCCGCC GACTTCCTGC GCTCCTCCAA 
CCCCTCCAAG ACCGACCCGC ACGGCGTCTC 
CCAGACTGAG CACGACCTCG ACGTCATCGT 
CCTGAGCCCC AGGGAGAGGG CCCGCGTCAT 
GCCCATCCTG ACCCACTACT TTGAGAAGGC 
CCACGAGCCC CATCTGCTCT TCAACTCGTT 
AAG CAT GC AG AAGGTCGGGC AGTGG TAAG A 
AGTTGGAACT AGAATCAGAT ATACAGCCTT 



CATGGGCTCG CGCATGCTCA ACGGCCTGGG 
GTACTCTATC ATGCACACCC CGTCCACCCG 
GTGCATCGTT CCCATGTGCA CCCACATCGA 
CACCGAGCAG GGCCTGGCCG ACGTGCGCGG 
CATCAAGAAG TGCGCCCACC CGGTCTACCA 
CGAGAGCGAC TGCCTACGCA AGGGCTGGGG 
TGACCTGCAC AAGGCCCTCG TGGAGCACGG 
TTGGCGAGAC GGGAGAGGCG TTGTTGTAGG 
TCATATATGT AGATAATGGA GCCATT 
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Figure 5 (SEQ ID NO 5): A. chrysogenum genomic DNA sequence of an approx. 16 kb 
region marked by SnaBI and Bfrl and containing the biosynthesis genes pcbC (position 1366 
to position 350, inverse arrangement) and pcbAB (position 2598 to position 13517). A single 
strand in 5'-to-3' orientation is shown. The particular translation start codons (ATG, GTG) 
and the particular translation stop codons (TAA, TGA) of the respective coding regions are 
depicted underlined and in bold type. The said cleavage sites are depicted underlined. 



1 


TACGTACATA 


CGTCGCGGGG 


GGTAGACAAT 


GGTGTGGTGT 


ACGTGTACAA 


CTACAGTCAG 


61 


ACATGGACGC 


AGGAAACGCA 


T CAT T GAT AC 


ATGCACACGG 


GGCAGCAAAT 


TTAGCCTGTT 


121 


TCACTACATG 


TACATAGAGG 


GTACACTCCA 


GAGCATACTG 


ATGGGAGAAA 


AAGGGTTCGA 


181 


TTGCTGGTGG 


TT TAA CAT AG 


CCGGCAAGGG 


GAAAAAAAAA 


AGGGGGCGGA 


GAAGGACTGA 


241 


TTCTTCCTGG 


C AG AC AC T C G 


ACCCTTCCGG 


CCCCTTGAAC 


TGCTTTTACT 


CCCGCATTCC 


301 


TCCGCACGCC 


CGCCCACAGC 


GGCAGATCAG 


CCGAACCTGA 


TCGACCGATT 


TAGGTCTGAC 


361 


CATTCTTGTT 


GATCAAGCCC 


CGCAGTCCCC 


CCTGCAGATA 


CTCTCCGTAG 


GAGATGGCCG 


421 


GCTTGTCCTT 


GGCGGCATCC 


TTGGCCCCAT 


CCTTGGCGGT 


CGCGGGGTCC 


CACGGCTGGA 


481 


TGGTGTCCTC 


CCAGCCCAGG 


TTGACGAAGA 


AGGGCAGTGA 


CTGGCGCTCC 


TCGTTGACCC 


541 


ATTTGACGCG 


GTGGATCGGG 


GCCGGGTAGT 


AGTCGTCGGT 


GATATGGGCC 


ATGTAGCTGC 


601 


CGCAGTTGAT 


GAGGAAGCCC 


GTGTCGTCAG 


CCTGGATGTC 


CTGCCAGCCC 


TGCGGGGTCT 


661 


TGACCTGCAG 


ATTCTGCACG 


TCGGACTGGT 


ACAACACCGT 


GAT GAG GGAC 


ACGTCCTCGT 


721 


GCCACTCGAA 


GCTGAGCTTG 


GTGCCGTCGT 


CGGCCGTCTT 


GATGGCCGGC 


TCCGGGTACG 


781 


GGTCGAGGTA 


CGGGTAACGG 


ATGAGCACGA 


CCGACGAGAG 


CGTCGTGTCA 


CGGCGGGAGT 


841 


GGCGGGTGAA 


GAAGTCCTCG 


TCGCGACCTA 


GGGCGAGAGC 


GTAGCCGCGC 


AGCACCGCGG 


901 


AGGAGAGGCC 


GAAGACGTCC 


CAGTAGTACT 


TCTCGGCGAA 


GGCCCGGAAC 


CCCGGGTGCT 


961 


TCGCCTCGTC 


CGGCCAGACG 


TTGACCTCGT 


GCATAGGGGT 


GGGCTCCTTG 


ATTCGCGGGT 


1021 


GGTCTGGGCT 


GAAGGAGGGG 


TTCAGGTAGC 


AGAACGATTC 


GACCGCCTTC 


TTGCCCGGGA 


1081 


TCGGCAGGTA 


GTAGCCCGCC 


CGGATCTGGG 


ACTCGTGCTC 


CTTGTTGTAG 


GCCCGGATGG 


1141 


CGAGCTGCCA 


CTTCTCCTCG 


TCCGTGATGC 


TCATGTGGAA 


TTTGTTCGTC 


TCGCGCGAGA 


1201 


GCCACGGCAG 


GTCGACACCG 


TGGTTCACCG 


CGTAAAAGAA 


GCCTGTGTCG 


CGCGATGCGG 


1261 


CGTCGATGGC 


GCGAGCTACC 


TCGAGCTTCT 


TCTCCTTGTC 


ATCGCCGAAT 


AGGGGCGAGA 


1321 


CATCGATTCG 


GGGGACGTTG 


GCCACTGGAA 


CTGGAACGGA 


ACCCATGGTG 


ACGGTTTGTC 


1381 


CTGCCTGGTG 


TAAGATGTGA 


AAGACGAGAT 


ATGCGTGAGT 


GACGATGGCG 


GAAGGAGAAG 


1441 


CCTCGAAAAT 


CAGAAGAGCG 


ACCAAAGGGA 


TATTCAAGTA 


TTCGCCCCTC 


TTGAAGCTGT 


1501 


TTATACGGGC 


GGCTGGGTGT 


GTGTATGTGT 


ACTTGAGTAC 


CTACCTCGTG 


TCTCCCGTTG 


i rr <r i 

J- U Kj J- 


C TAT AC GAT A 


TGAGCTTCCC 


CACGACGCGC 


CTTTATGGCC 


TGACCAAGGT 


CTCGATTATC 


1621 


CGGCTCCTGC 


GGGTGACACT 


GCCGAGGGGG 


GTTACATACG 


GTCCAGCAGC 


GGCGATGGAG 


1681 


TTTGGTCCCT 


GAAGACTGCA 


TGGCGGGGCC 


AAGCGATGAG 


GAACGCCGTT 


ACATGCATGT 


1741 


GCATGTAGAC 


GCCGCCACCC 


ATGAGGCCCG 


GAACAGTCTA 


TCGAAGCTCA 


GGGATTGGCC 


1601 


CGGCAACTCG 


ACGCCCCGTC 


GAGCGGCTCA 


CCGGTAGTCG 


ACGGCGTCCG 


TCGGAATCTC 


12 ~ 1 


GCGCTGCTGC 


GGGCCACCAC 


GGCGATGGGC 


CGTACACACT 


GCTACTACGG 


TGTACAATGT 


1921 


ATCATGTACC 


CGATCGACGA 


GGAACTCGGG 


GTAGAGGTAC 


CCCGTACAAT 


CCAGTTTCTC 



WO 2004/026902 



7/18 



PCT7EP2003/010289 



1981 


AACCCAATGG 


AACCACACAT 


ACGGGGTGGC 


TTTGGTTCAC 


GTTGCACTTT 


AAACTCGCAG 


2041 


ACGAGGGACC 


GACCTGCAGC 


GTGGCCCACT 


TCTGAAGCCT 


GCCCAGCTTT 


CTGCAAGACG 


2101 


CGGGCCATCG 


CGCTTGGCCG 


AGGAGAGAAA 


GGGTATCCAT 


GGCGACAAAG 


GCGGTCCTGG 


2161 


TGGGTTCGGT 


GCCGGCTTTG 


GAGTTCACTG 


GTCTGGGTGG 


GTGGCCAGCT 


GGATGCATGC 


2221 


ATTGGCCTGT 


ATCAAAGGTC 


CGGGATTCCC 


CAGGAGTATA 


AGACGTTCGT 


GCTGGGAGAT 


2281 


CTAGCGACGT 


GTTGGGAAAT 


ATCGGCCGTA 


GAGTGCGAAA 


AAGAACTGGC 


GGAAATATTT 


2341 


CTCCTTGGAC 


TCGGTCACAC 


TCAGTCAGTA 


GTGGACTGCC 


AGTCTATCAT 


ACACCTTTGA 


2401 


TATCAACATG 


ACTATCCTTA 


CAGGTGCCGA 


CGACGCCTCG 


TCATACCACA 


GGTATGTCTT 


2461 


CACAGCCTCT 


GGAAAGCGCA 


GTTGGGAGCT 


ATCTCTAACA 


TTACCACATC 


AGGCGCAATG 


2521 


GAAGCTCTGA 


TATCCCAAAA 


GGTGCCATCC 


ACCGCAACGG 


CTTCGCAGCC 


GCAGCCCCTG 


2581 


ACTGCTGGAT 


CCGGTCCGTG 


CTGTTTTCCG 


TGCACCAGAT 


GCTCAAGAGG 


TTCGGAAACG 


2641 


GATCTCACAC 


CGTCGTGGCG 


TCACTCGTAA 


CTTCATCAGA 


GGGATGCCCT 


TCAACTTCGG 


2701 


CCTGGAGGGC 


CATCCCCTCC 


GTCATCCATC 


ATATAGAGGG 


CGGAGACAAC 


AACAACACAG 


2761 


TCGCCTCTGC 


CGTGGAACAG 


GCGGCGAATC 


TCCTGAACTC 


AGAAGGATCG 


GGACAGGACC 


2821 


TTCTGATTCC 


CATCGGACTC 


ACTGAGCTCG 


TCAAGTCGGA 


GCTGATTGAC 


CTCCTGGTCA 


2881 


TCTTCGACGA 


CGAGACAAAT 


AAC AT AC G AC 


TGCCGCAGGA 


CTTCCCACTT 


ATCCTGCGGA 


2941 


TACATCAGCG 


GCAAGACCAC 


TGGCAGCTGT 


CAGTCCGGTA 


TCCCTCGCCC 


CTTTTCGACA 


3001 


CCATGGTCAT 


CGACAGCTTT 


CTGAGCGCAC 


TTCACAACCT 


GTTGTCCGCG 


GTGACAAAAC 


3061 


CCTCCCAGCT 


CGTGCGCGAC 


ATCGAGCTGC 


TCCCAGAATA 


CCAGGTCGCT 


CAGCTGGAGA 


3121 


AGTGGAACAA 


CACAGACGGC 


GACTACCCCA 


CCGAGAAGCG 


GCTACATCAT 


CTGTTCGAGG 


3181 


AGGCAGCAGT 


GCGTCGTCCC 


CAACACGTTG 


CCCTCATCTG 


CGGCGACAAG 


CGCATCACCT 


3241 


ATGAGGAGTT 


GAATGCTATG 


GCGAATCGCC 


TGGCCCACCA 


TCTGGTATCC 


TCGGGTATCC 


3301 


AGACTGAGCA 


GCTCGTCGGT 


CTCTTCCTCG 


ACAAGACCGA 


GCTCATGATC 


GCTACTATTC 


3361 


TGGGCATCTG 


GAAATCTGGT 


GCCGCGCATG 


TACCTATCGA 


CCCTGGGTAC 


CCGGACGAGC 


3421 


GTGTCAAGTT 


CGTCCTGAAT 


GATACGAAGG 


CGCAAGTGGT 


CATTGCTAGT 


CAGAGGCACG 


3481 


TCGATCGACT 


GCGGGCTGAG 


GCTGTTGGCG 


GCCAGCATCT 


TCGCATCATC 


GGTCTCGAAT 


3541 


CTCTGTTCGA 


CAACCTTGCT 


CAACAGACAC 


AACACTCACC 


AGAGACGTCG 


GGCAATTTGA 


3601 


CCCATCTGCC 


CCTGAACAGC 


AAACAGCTTG 


CGTACGTGAC 


ATACACCTCG 


GGCACCACGG 


3661 


GCTTCCCGAA 


AGGCATCTAC 


AAGGAGCACA 


CAAGCGTCGT 


T AAC AG CATC 


ACCGATCTGT 


3721 


CTGCTCGGTA 


CGGTGTGGCC 


GGGGAGGACG 


ACGAGGTGAT 


ACTCGTCTTC 


TCCGCCTACG 


3781 


TCTTCGAGCC 


ATTCGTGCGC 


CAGATGCTCA 


TGGCCCTGAC 


CACGGGCAAC 


TCTCTCGCCA 


3841 


TCATCAGCGA 


CGAGG AC AAG 


TTCGACCCTG 


ACACCCTTAT 


TCCCTTCATC 


CAAAAACACA 


3901 


AAGTCACTTA 


CATCCACGCC 


ACCTCGTCAG 


TGTTGCAGGA 


GTACGACTTC 


GGGTCCTGCC 


3961 


CCTCGTTGAA 


ACGCATGATT 


CTGGTGGGAG 


AGAACTTGAC 


AGAGCCGCGC 


TACGAGGCCC 


4021 


TGAGGCAGCG 


CTTCAAGTCG 


CGCATCCTGA 


ATGAATATGG 


CTTCACCGAG 


TCTGCGTTTG 


4081 


TGACGGCGCT 


C AAC AT AT T C 


GAGCCTACCT 


CACAGAGGAA 


GG AC AT GAGT 


CTGGGAAGGC 


4141 


CGGTGCGCAA 


CGTCAAGTGC 


TATATCTTGG 


ATGCCAACCT 


CAAGAGAGTC 


CCCATCGGTG 


4201 


TTACAGGGGA 


GCTGCACATC 


GGTGGCTTGG 


GTATATCCCG 


GGGGTACATG 


AATAGGGAGG 


4261 


AGCTCACAAG 


GCAGAAGTTC 


CTCCCGAACC 


CCTACCAGAC 


CGATAAGGAG 


CGCCAACGGG 


4321 


GTGTCAACTC 


AACCATGTAC 


AAGACAGGAG 


ATCTGGCCCG 


CTGGCTACCC 


AGTGGCGAAG 


4381 


TCGAGTATCT 


CGGCCGTGCC 


GACTTCCAGA 


TCAAGCTGCG 


CGGCATTCGA 


ATTGAGCCCG 
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4441 


GCGAGATCGA 


GTCCACTCTC 


GCCATGTATC 


CCGGAATCAG 


GGCCAGCATC 


GTCGTGTCAA 


4501 


AGAAGCTTCT 


CAGTCAGGGG 


CAGGAGACGA 


TCCAAGACCA 


CCTTGTGGGG 


TACTATGTTT 


4561 


GCGATGAGGG 


CCACATCCCC 


GAGGGTGACC 


TGCTGAGCTT 


CCTGGAGAAG 


AAGCTACCTC 


4621 


GGTACATGGT 


CCCGACGCGC 


CTTGTCCAAC 


TGGCTCAGAT 


TCCAACCAAT 


ATCAACGGCA 


4681 


AGGCGGATCT 


GCGTGCTCTT 


CCTGCCGTCG 


AAGTCGCCGT 


AGCTCCCACC 


CACAAGCAGG 


4741 


ATGGCGAGCG 


AGGAAACCAG 


CTGGAGAGCG 


ACCTGGCTGC 


CATATGGGGC 


AACATTTTGA 


4801 


GTGTTCCCGC 


TCAAGACATT 


GGGTCTGAAT 


CCAACTTCTT 


CCGCCTGGGT 


GGCCACAGTA 


4861 


TTGCATGCAT 


CCAGCTCATT 


GCTCGTGTGC 


GACAGCAGCT 


AGGCCAGGGG 


ATTACCCTCG 


4921 


AGGAGGTCTT 


CCAGACCAAG 


ACGTTGCGAG 


CTATGGCTGC 


CCTCTTGTCG 


GAAAAGTACA 


4981 


CGAAGGCGTC 


GAATGGGACG 


AACGGAGTGA 


CCAACGGCAC 


TGCTCACGTC 


AACGGCCACG 


5041 


CAGCGAACGG 


CCATGTCAGC 


GACAGCTACG 


TGGCCAGCAG 


TTTGCAGCAA 


GGCTTTGTTT 


5101 


ACCATTCACT 


CAAGAACGAA 


CTGTCCGAGG 


C GT AC AC CAT 


GCAATCCATG 


ATCCACTATG 


5161 


GTGTGCCCCT 


GAAACGGGAT 


ATTTACCAAG 


CGGCATGGCA 


GAGGGTACAG 


GGGGAGCACC 


5221 


CTGCACTGCG 


GCTTCGGTTC 


ACATGGGAGG 


CCGAAGTGAT 


GCAGATCGTG 


GACCCGAAAT 


5281 


CTGAACTCGA 


CTGGCGTGTT 


GTTGACTGGA 


CCGATGTTTC 


GAGCCGGGAG 


AAGCAGCTGG 


5341 


TTGCGCTGGA 


GCAACTCCAA 


ACGGAGGACC 


TTGCTAAGGT 


CTACCATCTC 


GATAAGGGGC 


5401 


CCCTTATGCG 


ACTATACCTC 


ATCCTGCTTC 


CGGACTCAAA 


GTACTCCTGT 


CTGTTCAGCT 


5461 


GCCACCATGC 


CATTCTCGAT 


GGGTGGAGTC 


TGCCCCTGCT 


CTTCAACAAT 


GTCCACCAGG 


5521 


CCTACCTCGA 


TCTCGTCGAA 


GGCACTGCTT 


CGCCCGTCGA 


GCAGGACGCT 


ACCTACCTAC 


5581 


TCGGCCAGCA 


GTACCTGCAG 


AGCCACAGGG 


ACGACCATCT 


CGACTTCTGG 


GCCGAGCAGA 


5641 


TCGGCAGGAT 


CGAAGAGCGC 


TGCGACATGA 


ATGCGCTGCT 


GAATGAGGCC 


AGCCGATACA 


5701 


AGGTGCCCCT 


GGCCGACTAT 


GACCAAGTCC 


GCGAGCAGAG 


GCAGCAGACC 


ATCAGTCTGC 


5761 


CCTGGAACAA 


CTCCATGGAC 


GCTGGTGTGC 


GGGAAGAACT 


CTCCAGTCGT 


GGCATCACCC 


5821 


TTCATTCCAT 


TCTACAGACG 


GTCTGGCACC 


TGGTCCTCCA 


CTCTTATGGA 


GGAGGCACCC 


5881 


ACACGATCAC 


CGGCACCACC 


ATCTCCGGCC 


GTCACCTGCC 


CGTCCCCGGA 


ATTGAGCGCT 


5941 


CTGTTGGTCT 


CTTCATCAAC 


ACACTCCCTA 


TGATCTTTGA 


TCACACCGTC 


TGCCAGGATA 


6001 


TGACAGCGCT 


CGAGGCCATT 


GAGCATGTCC 


AAGGCCAAGT 


CAACGCCATG 


AACTCCCGGG 


6061 


GCAACGTCGA 


GCTCGGACGC 


ATGAGCAAGA 


ACGACCTCAA 


GCACGGGCTC 


TTCGACACCC 


6121 


TCTTCGTCCT 


CGAGAACTAC 


CCAAACCTCG 


ACACGGAGCA 


GCGGGAGAAG 


CACGAGGAGA 


6181 


AGCTCAAGTT 


CACCATCAAG 


GGTGGCACGG 


AGAAGCTCAG 


TTACCCGCTG 


GCCGTGATTG 


6241 


CCCAAGAGGA 


CGGCGACAGC 


GGATGCTCGT 


TTACGCTCTG 


CTATGCGGGC 


GAGCTCTTCA 


6301 


CGGATGAGTC 


CATCCAGGCG 


CTCCTGGACA 


CTGTCCGGGA 


CACCCTGAGT 


GATATTCTCG 


6361 


GGAACATCCA 


TGCCCCTATC 


CGCAACATGG 


AGTACCTCTC 


CTCGAACCAG 


ACGGCGCAGC 


6421 


TCGACAAGTG 


GAATGCCACC 


GCCTTCGAGT 


ACCCCAACAC 


CACACTGCAC 


GCCATGTTCG 


6481 


AGTCCGAGGC 


GCAGCAGAAG 


CCGGACAAGG 


TGGCCGTGGT 


GTACGAGGAT 


ATCAGGCTGA 


6541 


CCTACCGCGA 


GCTCAACAGC 


CGTGCCAATG 


CCCTGGCGTT 


CTACCTCCTC 


TCCCAGGCGG 


6601 


CTATCCAACC 


GAACAAGCTG 


GTCGGGCTGA 


TCATGGACAA 


GAGCGAGCAC 


AT GAT C AC G A 


6661 


GCATCCTCGC 


GGTCTGGAAA 


ACGGGTGGAG 


CCTACGTCCC 


GATCGACCCT 


CGATACCCTG 


6721 


ACCAGCGTAT 


CCAGTATATC 


CTGGAGGATA 


CGGCGGCTCT 


CGCAGTCATC 


ACGGACAGTC 


6781 


CTCATATTGA 


CCGTCTGCGC 


AGCATCACCA 


ACAACCGCCT 


TCCTGTTATC 


CAGTCGGACT 


6841 


TTGCTCTCCA 


ACTCCCGCCC 


AGCCCAGTTC 


ATCCCGTCTC 


AAACTGCAAG 


CCAAGCGACC 
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6901 TCGCCTACAT CATGTACACA TCCGGCACCA CTGGCAACCC CAAGGGTGTC ATGGTGTAGC 

6961 ACCACGGTGT AGTGAATCTG TGCGTTTCAC TCTGCCGGCT CTTCGGCCTT CGGAACACAG 

7021 ATGACGAGGT CATCCTCTCG TTCTCGAACT ACGTCTTCGA CCACTTTGTC GAGC AG AT G A 

7081 CGGATGCCCT TCTCAACGGT CAGACTCTTG TGGTCCTCAA CGACGAGATG CGTGGCGACA 

7141 AGGAGAGGCT TTACAGATAC ATCGAGACCA ACCGCGTCAC GTACCTCTCG GGGACACCTT 

7201 CCGTCATCTC CATGTACGAG TTCGACCGGT TCCGCGACCA CCTGCGGCGC GTGGATTGCG 

7261 TCGGCGAGGC CTTCAGCGAG CCGGTATTCG ACAAGATCCG CGAGACGTTC CCGGGTCTCA 

7321 TCATCAACGG TTATGGCCCG ACTGAGGTGT CTATCACTAC CCACAAGCGC CCCTACCCGT 

7381 TCCCGGAGCG CCGCACAGAC AAGAGCATCG GTTGCCAGCT GGACAACAGC ACGAGCTACG 

7441 TCCTCAACGA TGACATGAAG CGCGTGCCCA TCGGGGCCGT GGGAGAGCTG TACCTTGGTG 

7501 GCGATGGCGT CGCTCGCGGA TACCACAACC GGCCAGACCT GACGGCTGAC CGGTTCCCTG 

7561 CCAACCCCTT CCAGACGGAG C AG GAG AG AC TTGAGGGCCG AAATGCGCGT CTGTATAAGA 

7621 CTGGTGACTT GGTTCGCTGG ATCCACAATG CAAACGGCGA TGGTGAGATC GAGTACCTCG 

7681 GCCGCAACGA CTTCCAGGTC AAGATTCGAG GCCAGAGAAT CGAGCTGGGA GAGATCGAGG 

7741 CCGTGCTTTC ATCCTATCCG GGCATCAAAC AATCCGTCGT CCTGGCCAAG GACCGCAAGA 

7801 ATGACGGGCA GAAGTACCTC GTCGGCTACT TCGTCTCCTC AGCAGGGTCC CTGTCCGCCC 

7861 AGGCCATCCG CCGCTTCATG CTCACGAGCC TGCCCGATTA CATGGTTCCT GCGCAGCTGG 

7921 TGCCCATCGC CAAGTTCCCC GTCACCGTGA GCGGGAAGCT CGATGCCAAG GCCTTGCCCG 

7981 TGCCAGACGA TACAGTCGAG GATGACATTG TGCCACCGCG TACCGAGGTT GAGCGCATCC 

8041 TAGCTGGGAT CTGGTCTGAG CTGTTGGAGA TACCGGTCGA CAGGATCAGC ATCTACAGTG 

8101 ACTTCTTCAG TCTGGGCGGC GACAGTCTCA AGAGTACCAA GCTGTCCTTT GCTGCCACGC 

8161 GGGCTCTCGG TGTGGCCGTC AGTGTCCGCA ACTTGTTCAG CCATCCGACT ATCGAAGCCT 

8221 TGTCTCAGTG GATTATCAGG GGTTCGAACG AGGTCAAGGA TGTGGCTGTG GTGAAGGGCG 

8281 GTGCCAGTCT TGATATCCCC CTATCCCCTG CCCAGGAAAG ACTCATGTTC ATCCACGAGT 

8341 TCGGCCATAG CGGCGAGGAT ACTGGTGCTT ACAATGTGCC TTTGCAGCTG CAGCTTCACC 

8401 ATGATGTCTG TCTCGAGTCG CTTGAGAAGG CTCTGCGGGA TGTCGTCTCG AGACACGAGG 

84 61 CTCTCCGGAC CTTGATCACC AGGACCCAGA AGTCCTCCGT GCACTGCCAG AAGATCCTCG 

8521 AC GC C GAAGA AGCGCAAAAG CTCTTCTCTG TTGATGTTCT GCGCCTGACC TCGGAGACGG 

8581 AGATGCAGGG CAGGATGGCC GAGAGTACCG CCCACGCCTT CAAGCTCGAC GAGGAACTCC 

8641 CGATTCATGT ACGCCTGTAC CAGGTTGTAC GTGATGGCCG CACGCTCAGC TTTGCCAGCA 

8701 TCGTCTGCCA CCATCTGGCG TTTGACGCGT GGTCATGGGA TGTGTTCCAG AGGGACTTGG 

87 61 ACGCCTTCTA TGCCGTCCAT ACGAAGCACA AGGCTGCCGC CAACCTGCCA ACCCTCCGCG 

8821 TGCAATATAA GGAGTATGCG AT AGAGC AC C GCCGGGCTCT CCGCGCTGAG CAACACCGTG 

8881 TTCTCGCGGA CTACTGGCTG CGCAAGCTCA GTGACATGGA GGCGTCTTAT CTGGTCCCCG 

8941 ATCGCCCTCG ACCGGCGCAG TTTGACTATA CCGGGAACGA TCTCCAGTTC TCAACTACTC 

9001 CCGAGACCAG CGCGCAGTTG AAGGAGCTGG CCAAGCGCGA GGGTTCAAGC CTCTACACCG 

9061 TTGTGGCGGC GGCGTACTTT CTGCTTCTCT ACGTGTACAC CAACCAGCGG GATATCACGA 

9121 TTGGTATTCC CGTTGCGCAC CGTAACCATC CGGACTTTGA GTCGGTTGTC GGCTTCTTTG 

9181 TCAACTTGCT CCCTCTGCGG GTCAACGTGT CTCAGTCGGA CATTCATGGA CTTATCCAGG 

9241 CAGTGCAGAA AGAGCTTGTC GATGCCCAGA TCCATCAGGA CTTGCCATTC CAGGAGATCA 

9301 CCAAGCTTCT TCATGTGCAG CACGATCCAA GCCGCCATCC CCTTCTCCAG GCCGTGTTCA 
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9361 ACTGGGAAAA CGTACCCGCC AATGTCCACG AGGAGCAGCT GCTTCAGGAG TACAAGCCGC 

9421 CCTCGCCTCT GCCTTCGGCG GCCAAGTTTG ATCTCAACGT C AC GGT G AAA GAGAGCGTCA 

94 81 ATTCGCTCAA CGTCAACTTC AACTATCCTA CCAGCCTCTT CGAGGAGGAG ACCGTTCAGG 

9541 GGTTCATGGA AACCTTCCAT CTCCTTCTTC GACAACTGGC CCACAACAAG GCTAGCACAA 

9601 GCCTCTCGAA GCTGTCGGTT GAAGATGGAG TGTTGAATCC AGAGCCGACT AACCTTCAGC 

9661 CCTCAAGCCG GGACAGCGGA AATTCACTCC ATGGGCTCTT CGAGGACATC GTGGCCTCGA 

9721 CCCCGGACCG CATCGCAATT GCTGACGGCA CCAGGAGTCT CTCGTACTCC GAACTCAACG 

9781 AGCGGGCAAA CCAGCTGGTA CATTTGATCA TCTCTTCTGC CAGTATTGTA GCAGACGACC 

9841 GCATCGCTCT TCTTTTGGAC AAGAGCATCG ATATGGTGAT TGCTCTCCTG GCAGTTTGGA 

9901 AGGCCGGTGC CGCATATGTG CCCCTTGACC CGACATATCC GTCGCAGAGG ACTGAGCTCA 

9961 TCTTGGAGGA ATCTAGTGCC AGGACGCTCA TCACCACTAG AAAGCACACG CCGAGGGGAG 

10021 GAACAGTCGC AAATGTTCCA TCCGTGGTCC TTGACAGCCC CGAGACCCTA GCCTGCCTCA 

10081 ACCAGCAGTC AAAGGAAAAC CCGACAACGT CAACGCAGAA ACCGTCCGAC CTCGCATATG 

10141 TCATCTTCAC CTCGGGAACC ACAGGCAAGC CCAAGGGGGT TCTGGTGGAG CACCAGAGCG 

10201 TAGTCCAGCT GCGCAATTCC CT CATC GAG C GATACTTCGG CGAGACCAAC GGGTCTCACG 

10261 CCGTGCTCTT CCTGTCCAAC TACGTCTTCG ACTTCTCTCT TGAACAGCTC TGTCTCTCAG 

10321 TCTTGGGTGG AAACAAGCTC ATCATTCCAC CAGAGGAGGG TCTCACGCAC GAGGCATTCT 

10381 ACGACATCGG CCGCAGGGAG AAGCTATCCT ATCTCAGCGG GACGCCCTCG GTGCTGCAGC 

10441 AGATTGAGCT CTCCCGTCTG CCGCATCTTC ACATGGTCAC CGCTGCGGGC GAGGAGTTCC 

10501 ACGCTAGTCA GTTTGAGAAG ATGCGCTCCC AGTTCGCGGG C C AG AT C AAC AACGCCTATG 

10561 GTATCACTGA GACGACCGTG T AC AAC AT C A TCACCACGTT CAAGGGCGAT GCCCCCTTTA 

10621 CCAAGGCACT CTGCCACGGG ATCCCCGGAA GTCACGTCTA CGTCCTGAAC GACCGACTTC 

10681 AGCGTGTTCC TTTCAACGCT GTTGGCGAGC TCTACTTGGG CGGTGACTGC CTTGCTCGCG 

10741 GGTACCTCAA CCAGGATGCC CTGACCAACG AGCGATTCAT CCCCAACCCT TTCTACGAGC 

10801 CGAAACAGGC AAGTGACAGT CGTCCCCAGA GACTCTACAA GACTGGAGAT CTGGTGCGCT 

108 61 TCCGTGGACC CCACCATCTC GAGTATCTCG GCCGCAAGGA CCAGCAGGTC AAGCTGAGGG 

10921 GCTTCCGCAT CGAGCTCTCC GAGGTGCGGG ATGCCGTCCT AGCCATCTCT GCTGTTAAGG 

10981 AGGCTGCCGT CATCCCCAAG TATGACGAGG ATGGCTCCGA TTCACGAAGG GTCAGCGCCA 

11041 TCGTCTGCTA CTACACGCTC AACGCCGGAA CTGTGTGCGA AGCATCGAGT ATCCGTGACC 

11101 ACCTGCACGC CAACCTTCCC CCGTACATGG TCCCAAGTCA GATCCACCAG TTGGAGGGAT 

11161 CTCTCCCCGT GACCGTCAAT GGGAAGCTCG ACCTGAACAG GCTCTCCACA ACTCAAGTCT 

11221 CGCAGCCAGA GCTTTACACC GCTCCACGAA ATTCGACAGA GGAAACCTTG TGCCAGCTTT 

11281 GGGCATCTCT CCTAGGCGTC GACCACTGCG GCATTGACGA CGACCTGTTT GCCCGAGGCG 

11341 GCGACAGCAT CTCCTCTCTC CGACTAGTGG GTGACATCTA CCGCGCGCTA GGACGCAAGG 
TCACCGTCAA G G AC AT C T AC CTCCACCGCA GCGTCCGAGC CCTAAGCGAA AATGTCCTGA 

114 61 CCGACCAGAA GGATAAGGGT ACTCTGCCAG CGTCTCCTCC CCTCCAGCGA GCGGAGCAGG 

11521 GCCAGGTTGA GGGCGACGCA CCGCTTCTCC CCATCCAGGA CTGGTTCCTT TCCAAGCCCC 

11581 TGGATAACCC CGCTTACTGG AACCACTGCT TCACCATTCG AACCGGGGCA CTCTCCGTCG 

11511 AAGGGCTCCG GGGTGCTCTG AAGCTGCTGC AGGAGCGCCA GCACGTGCTG CGTCTGAGAC 

• • :r.:. TGCAACGCCG GGACGAAGGT CGCCATGTTC AGACCTTTGC GCGTGACTGC GCGCAACCTC 
GCTTGACTGT GCTAGACCGA CGAAGCTTCG AGGACGCAGA GGATGTACAG GAGGCTCTCT 
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CACTGTCCCC 


1 R"3fi1 


1 bAl bbAbAA 


bAbbbbbbAb 


7V m r*"« 7\ m <^ 7\ 7\ /~i 

ATCATCAACG 


GCGACACAAT 


CGCCAAGATG 


AAAAAGGGTG 


1 R4 91 


TPPTPPTPBT 
1 001 Ob 1 OA1 


bAAbAbb 1 CO 


C C C c~* frn f*> i-* f 

CGCGGTGGCC 


TTGTGGACAC 


CCGTGCAGTC 


ATCAAGGCCC 


1 R4 R 1 

X 3 ft D X 


I bAAAAbAAA 


btALAl 1 bbb 


GGCGTAGCCC 


TCGACGTGTA 


CGAGGCCGAA 


GGCTCACTGT 


1 R R 4 1 
X 3 3 fl X 


r PP ,r P7\ r~T , i\ r^r^T\ 
Itl Hl,bALbA 




GAG ATT AT CC 


ACGACGATGT 


CCTCATGCGC 


CTCATGACAT 


X 3 DUX 


I bCCCAACGT 


CATCGTCACG 


GGACACCAGG 


CCTTCTTCAC 


CGAAGAGGCG 


CTCGAGGAGA 


15661 


TTGCACAGTG 


CACGCTGCGT 


AACCTGGAGG 


AGTTCATCAA 


GGAGGGAACA 


TGCAAGAACT 


15721 


CGCTGACCAA 


GGAGCCCGAA 


TTGAGGTCCA 


AGGTCCCTGA 


CCCGGTGCGC 


AATGTTTAAA 


15781 


TTGATGTGGA 


TGATTGAATT 


CTATATTCTG 


GTATCTCTGT 


CTATGTACGG 


TCATCTGAAA 


15841 


CTTTTGATGC 


TGGTTAAATG 


GTGAGTCCTG 


CTAACGCCAC 


ACACAAACAC 


ACACACGCAC 


15901 


ACACACAGNG 


CATATTGAGA 


CGAAACT GGG 


GAAAGCTAAG 


TAT CAAT AAA 


AC AC AAAAC G 


15961 


AAATGGACGG 


AAGGATATCT 


CCCGCTCTAG 


TATATAAGGC 


GT AC G AAAAC 


ACCCGTTGTA 


16021 


CAACCGCTTA 


AG 











WO 2004/026902 



13/18 



PCT/EP2003/010289 



Figure 6 (SEQ ID NO 6): A. chrysogenum genomic DMA sequence of an approx. 5.8 kb 
region marked by EcoRV and BamHI and containing the biosynthesis genes cefD1 (position 
2372 to position 180, inverse arrangement) and cefD2 (position 3888 to position 5133). A 
single strand in 5'-to-3' orientation is shown. The particular translation start codons (ATG) 
and the particular translation stop codons (TAG) of the respective coding regions are 
depicted underlined and in bold type. The intron cefD2 and the said cleavage sites are 
depicted underlined. 



1 


GATATCTGAG 


TGGTTGTTCC 


GCGCGTTACG 


GAGTCATAGC 


CGGTTGAGTT 


CGGCCGAATC 


61 


TGCCCTCTAT 


GTTGTGTTTA 


CGCTCTTCAG 


GCTTTATGCG 


GAATTACATG 


TGTTTTGCAA 


121 


GCCACATTTG 


TTTTCTATGT 


TACATCGTTT 


CACGCGTCGA 


TGGGTCCCGT 


TAATAGCAAC 


181 


TAGAGTCTTG 


GCCGAGCCGC 


CGACTGGCCG 


GACAGTAGAT 


CCCAGTCCCG 


CTCGGTAAAG 


241 


GGAACATACT 


TGCTCGCACC 


GGCCGGCAGC 


CAGAAGAAGC 


GATCCTTTTC 


ACTTCCAGGG 


301 


ACCTTGCTGC 


CCATGGAGCG 


CGGGTCGACC 


CCCTCGTCGC 


GGAGTGGCAC 


CTTGTTGTGT 


361 


TTATGGTTAT 


CCGTGCTCAT 


GCCTCCCACC 


GTCTCCCGCA 


CCCTAATGAA 


GACAGGAACG 


421 


GCGTAGGAGG 


GCAACTCTGA 


CCGCAGGAGC 


GACGTCAGCC 


GTGACCAGTC 


CAACGTGTCT 


481 


GGGGTTGCCG 


CGTTCTTCAA 


TGCGATGGCT 


GCGCAGCCGG 


CTCGCCCGTC 


GTGGTTGGGG 


541 


ACCTGGACGC 


CGTAGACATT 


GGCCTCGGCA 


ATGTCTGCGT 


GTGAGCCCAG 


GACTTGTCCT 


601 


ACCTCGGTCG 


TGGACACGTT 


TTCTCCTTTC 


CAGCGGTATG 


TGTCGCCTGG 


TATTCACGTT 


661 


TAGCATTGTC 


ACTCTGCGAT 


TGTAGCAGGG 


ATAACGTACC 


AAGTCGGTCG 


AGAAAGTACC 


721 


AGTGACCGTC 


GGCATCGCGT 


CGAAGTACCA 


GTGACCGTCG 


GCATCGCGTC 


GAAGAGCATC 


1 O 1 

lol 






OCUUC-L 1 L,l C 






TCTTCTGCGT 


841 


CGCCTCCTCA 


GCATGCCAGT 


ACCCAGCCCA 


GGCTGACCGT 


GACGGGAGCC 


GCGCCAGGAT 


901 


CTCACCGCCC 


CTCTCGTACG 


GCAGTCGCTC 


GGCAAAGCCC 


GTCTTCGGCG 


ACCGCCAGAT 


961 


GTCACCCGTC 


TCCGGGTCGA 


TCCTGACGGG 


TACGTAGTCA 


TTGTGAAACT 


TGCGGCGGAG 


1021 


GAGCCAACCG 


TGGTGGCCGA 


CGGCGCCGAG 


GCCGAAACCA 


CCGCCGCGGT 


AGTGTTTTAA 


1081 


GAGTGTCAGG 


ACGCCTTCTG 


TGCTGGCGTA 


GAATTCACCG 


ATGTCCGATA 


CGCCGAAGCG 


1141 


GTCCTGTTTG 


TCGCTTGGTG 


TCAATCCTAT 


GGTTTCGATG 


GGGCTGGACC 


GAAGTAAAGG 


1201 


GG GAT CAT AC 


CTGGAATTTG 


GTCCAGAGTT 


CCGGGCTGAG 


TCCGTTCCCC 


CAGACGAGAC 


1261 


GGACGCGGTG 


CTGACGGTCC 


TTTGGTGAGG 


CCGGAGCAGA 


GAGTAGGTAC 


CGAATAAGTT 


1321 


CCCCGACTGT 


ACACTCGTCA 


GGAGGGCAAA 


CAAGTCGAAG 


AAGGGTACCA 


AGGGAGAGTA 


1381 


GTGGCTTACC 


ATAGACAAAT 


ATTGTTGACC 


CGCTCTCTAT 


GCAATCGTCC 


CAGAAGCGAG 


1441 


ACAAGGAGAA 


CTTGGGCGCA 


AGAGCAATGG 


ATATTCCGCT 


CATCAAGTCG 


TTCATGGCCG 


1501 


CGATGCCCCC 


CGTTCCGTGG 


TAGAGCGGGA 


TGCAGTAGTA 


GGTGCGGTCA 


CCGTTGGGGC 


1561 


CCGGTTTCTG 


CCCAAATGTC 


TTTGGCAGCA 


GTGAAGCAGA 


GGGATAGTTC 


CTGGCCACTG 


1621 


TGATGGGTGC 


GGCCTTTGGC 


AGGCCAGTCG 


TACCACTTTT 


GCCGTGAGAA 


TGTGCAACTG 


1681 


GTGGGTGAAG 


TCCGAGGCTT 


TACTCACCTA 


GTGTACATCA 


AAGCAAACGG 


TAGCAGCACC 


1741 


TTGGTGTCCT 


CGAAGCAATC 


TACCGGTGCC 


CTGTGTGTTT 


CTTTCCGGGC 


AATGTCTTCC 


1801 


TTCAGCGTGC 


CGGAGAGCAT 


GATAGCCTCG 


ACATTGATAT 


CCCGAAGCCG 


CTCACCAACT 
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Xo OX 


TP^Tna atpp 

1 uva 1 variri 1 


GAGAGGAACA ATCGGACGCA 


TPATCGTATA 

x un x wvj x n x / 


TPAGAAAPPG 


PGAGPPGGAP 


1 QOI 




AGTGAACCAG 


AGCATCAGAG 


PPGAGATTPT 


AATTPATPAG 

r\t\ X X OjM X 


PPPAPPAPPA 


1 Qftl 


PPAPPPATAG 


ACAGTAGACC 


CATCCAGATG 


AACATCAGCT 


PPGGPGAGTT 


GTACAGATAA 




ACACCGACGT 


GCTGCCCGGC 


CACCACGCCC 


AGATCGCGGA 


AGTAGTGGCC 


ATACTGACAT 




PCGCGTTGGT 


ACGTCTGCGT 


CCACGAGTAC 


TCGGGGTGCC 


CGCGCGACCA 


GATGCAAGGT 


91 61 


GCATCGCCGA 


GGCGGGCAGC 


GGCGGCCTCG 


AAGAGGAAGA 


AGCCAGATGC 


CTTGCGCTGC 




TPAACAGCTC 


TGGCGAAGTT 


CTGGGCCCCC 


CGTTCGGCGC 


GGGCGAGCTG 

Vjvj wvvnww Ji vj 


GTTGAGGTCC 


99ft 1 


TTGGTGAGGT 

1 A V3V3 X \3^\\3\3 X 


GAAGCTTGGC 


GTCGAGGTAG 


GCAGCGGCGG 


PGGTGPTGGC 

V>\jVj X Uvy X V3W 


GGPAGPGGCG 

uvjvnuv/VjVjvVj 


9 ^ A 1 


PPAPPPAPPP 


TTAGTAGGCC GCCGGGTGCC ATTGGGGATT 


PPPGTTATPP 


TPPPPTPTGT 


z 4 U X 


lab 1 1 la x Va 1 U 1 


TTGCGAGCGA 


TGTCCTTGCC 


TCCGAGACAT 


TPPTPPTPPT 
1 bb x bb 1 VaVa 1 


bab. 1 obj 1 ovab* 1 




rrirp«^7\ f~» rp f~> rp /— i rp 

J. 1 la Ab 


CGGAGAAGGG 


TGTAAGTGAT 


CGTGCGATCG 


bbJAb 1 \^\^Jt\f\t\ 


apTTpappaa 

A\a 1 1 VaAlwb-AA 


Z OZX 


apappr*apap 

Ao Ab* la orAb^JrlU 


AAPCAGTATT 

nn^vnu J. x x 


GACGAATAGT 
vjfiv^vjnrv x **o x 


TGTTGAAGAC 


r T , (ZC r VHCZC*C ,T VC 
X x riVav^V-, 1 \-» 


pppapppaPA 

bbbribbbnbn 


otpi 

6JOl 


appTTT^ppap 


ACCTGATGGT 
jtaVw. v x ojn x uvj x 


GTGGCGGGTC 


TGTGGPTTTG 
x va x vavava x x x %a 


f^PPPPTTTTP 
bbbbb 1 111b 


PP A PPPPPPP 
bbHbbbbbbb 


Z o^t X 


a a p p p a p a t 1 p 


GATGGAGGGA 


GATCGGCGGC 


GATGTTGCGG 
\jr\ x va x x vjvuu 


LvbUnVy 1 £±£\\^ 


A APAPPTPTP 
ririvairiVao lulu 


o "7 n i 


r*PTaPPTPPP 
1 Abo 1 bbo 


GAPTPPPTAP 


GPTAPTPTAP 

O O X jrl\_, 1 O A jtXU 


PGPPPPPGTG 


UnHLLnouvn 


PTPAaPaTTP 
u 1 br/irioAl 1 b» 


O *7 £ 1 
Z / Ol 


a aPPaPTPTT 
AAoori.b, loll 


CPTGTPPPAT 


ATGPPPPPPP 


PP ATPPGTPT 
u o/a x va\-#vj x \a x 


pp a pptt a f a 


PTPTPPaPT A 
bl bl b^laAb 1 A 


Z O Z 1 


PaTAPaAaPA 


TTPTAPPTTA 

X X O X nV^Vrf 1 In 


PPTAAAAPAA 


PATPPAPAPT 


a aTapppTpa 

Ari 1 rib* o\>> 1 b>ri 


PTPTTPPPTa 
blbl 1 lalab 1 A 


OQQ1 
Z O O X 


r*fZfZfZ A T P A T A 
Ububn 1 la/iXri 


TTPAPTPPPP 


AAGAPPGAPG 


PPAAPPAPPP 


paTapTPTap 


ppaTappp'ra 

b»b» A 1 Alalab 1 A 


OQ/1 1 


ppTPPTPPa a 

bo 1 00 JL bb/i/i 


APAPPGTGGP 


PAAPTPATPP 


PTTPPTAPPP 


r rapaar , ar , r , p 


obAbAbJobbla 


JUUl 


t r* a fine p rifz p 

1 oAooobrOlao 


TPGPPTPPPP 


GGGPAGAAGT 


PPTTAPAPPT 


1 HAbjAvabv I Ala 


1 bla I 1 AAbbj 1 


n £ i 


1 Abn 1 la 1 1 


AAGCATTGGT 


CTGGATCGAA 


TCAGATGCTG 


1 bbbl 1 laAO 1 


laoA 1 bb>Alabb7 


jl^J. 


bbbbbbbbbb 


CTGCGTCTTA 


ATATAGCAAG 


TACCCGTGAC 


1 A 1 la 1 AAla i 1 


la 1 Ab A 1 la i Ab 


^1 ftl 


nlbibl 1 la 


PPAPAGPAPP 


GPPTATA ATP 


TAPAPPPAPP 


1 IbbAbbnAb 


All AbAI i laA 


•30/I 1 


iAbl 1 larval 1 


TPTPPPTAPA 


TPAPAGPPPP 
l bnbAbbbbb 


PTPATATT AP 


Ala 1 AbA 1 tab. 1 


bbGbbAAb 1 A 




A 1 Ab. AAo Ab, A 


PPPAPPATPP 


PAPPPA A APA 


TTTPTS PTPP 
1 1 ibl jrib^ JL bb 


AlaAA 1 (aA i tab, 


blbAl 1 iAtaCa 




bobAbAo Ibi 


GCATACATGC 


CATGCCATAA 


TGAGGTCGTG 


TA1 CCGTAGT 


PTPrnpprpnmp 

CTGTGCTCTG 


O/OI 

jfl^l 


oA 1 b/\b 1 A 1 la 


AATTATGTCT 


GAAGTTATTA 


CTTGGCAACA 


Abb 1 lab> 1 1 (abx 


/*ip7\ 7\prppppp 

bbAAb X biGlab 


^4 ft 1 

O *i O X 


n.bbl 1 x v^/i.1 1 


CATGATGGGA 


GGATGGAGAC 


GATGAGTCTA 


ibbl 1 bbAbA 


PTIPPP7\ T , P^P , P' 

b 1 blabAl (abb 




rprrnTprTT 

1 


GGTGCAGCTA 


GATTTTCAGC 


TTCGATGCAC 


AlaCab 1 bbbbb 


b 1 I CaAl Abi A 


3601 


CAGCAGAGTA 


CAT G C C GAGA 


CAAGAATCTT 


CAACGTCCCC 


GATGCGCTTT 


TGATATCCAC 


3661 


CCATGTTTTC 


ATAGTCGCTG 


CGCGGTATCG 


GACCCGAGTC 


GTCTGTACAC 


GTGGGTAGAA 


^7 91 


PTTAPPPPA A 
bl l jriuoUOjri/-\ 


TGCCACCTCA 


CCACGCTGCA 


CCATCCATGC 


boAolabAtalab 


1 I bCaAbAl 1 la 


"37 ft 1 
J / Ol 


p A APTaPPP A 

LAAb 1 /-*b.oori 


CTACAATAAC 


ATCAAATGGC 


ATGGTGGGAT 


1 1 bblabllaAb 


bAbb 1 tabbbA 


A 1 
O O H X 


b*A 1 IrtbnibJ. 


TGTAGTCTTG 


ACCATTCACG 


CGGTTAATCC 


bAbb 1 blaAlG 


bjAbbbblbl b 


3901 


GCCCACATCC 


GCTATCCGGC 


AAGCTTGTCG 


TAGAGCTCGC 


GGGGCTAGCC 


CCAGGTCCGT 


3961 


CATCCATAGC 


CACTTTCATG 


TTGCTCTGCC 


GAGAGGCCTA 


CGCATAGAAG 


CCGCTTCTTG 


4021 


CGCTCACGCA 


CTAACTCGCG 


CACTCCAGGC 


CCATTCTGTG 


GCATGCTCTT 


GGCAGACTAT 


4081 


GGCGCCTCAG 


TACTCCGCAT 


CGACGGACCG 


CGATCCCCAA 


AGGGGGACGT 


CCTGGCGAGG 


A "1 A 1 
i J. *a J. 


AACAAGTCGT 


CCATCTGCAT 


CGACTTGAAG 


CATCCGCCCT 


CACGCAAGGT 


GCTCCTCTCC 


4201 


ATCCTGTCCC 


GCGCGGACGT 


GCTCATCGAC 


CCGTTCCGGC 


CCGGCGTCCT 


GGAGCGTCTG 


4261 


GGGCTCTCCC 


CCACAGAGGT 


CCTTCTCAAG 


GCGAATGCCC 


GCCTGGTGGT 


CGCCCGTCTC 
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4321 


T\ p"« /— • p» C* mrri/~i /—» 

ACCGGCTTCC 


bbbGAbATbb 


/~'T\7\P" ,r PT\P , P" , TVP* 

bAAb 1 AC LAG 


bAbAl bbLAb 


r* r* r* 7a t /™* 7A t 7a t 1 
bbbAibAlAl 


r* a a r* m a r* r* t r 1 

bArlb JL AV»»V^ 1 V^ 


4381 


GCCGTGTCTG 


bbGl LL fGbt 


rp tv mp"»p»rnrpp»/~»m 

lAlbbl lbbi 


AbbbbAbbbb 


tv p^ TV flTPPPTT 

AbAAlbbbl 1 


bbbbbbbbbb 


4441 


7\ 7\ P" , T\rpp^/-«rp/-'p^ 

AACA1 CClCb 


p^p^p^Tvp^mmrpp'p* 
bbbAbl 1 I bb 


bbbAbbbbbb 


r* r* a t ^ t ^2 r" r* 


1 bblbbbAAl 


Jlbl v-3v^ X bbbb 


4501 


/-i rp /~»/~" rp 7\ l■^P , p■"P >, 

CTCGTATCGC 


P"»OP" TV rpP»P»p«TV pi 

bCGA JTbCCAb 


P ,, /^P , P , P , T' r PP , P*P > 

Gbbbbi lbbb 


r* 7i r* o t r* *p C r* 
bAbb 1 bb i bb 


AbbbbAAbAi 


bb 1 bbnbbbb 


4561 


TCI GCGTACb 


fTPPPP7\ P* P" TV HP 

1 bbCCACGAl 


P'/"»P"»p»p > i/-«p»P*rnpi 

bbbbbbbb 1 G 


b>C bnb L. AAGA 


bbbbbl Iblb 


bbb X 1 LbLbb 


4 621 


CGGGGCGAGA 


Tv m /~" rri /""• /~ ' rn p* p* 7v 

ATGTCCTGGA 


/*«/^/*^7VP , /~ , P ,, rTip , p , » 

CGGAGGGTGC 


/^» r *^/-^mp , »p*»rp TV rp p^ 

GGTAIG 


P^P" A P* 1 A T 1 7\ r*i"*f^ 
C bAb A 1 Ab bb 


PBP1J\3\PPBP 
b Ab/iAiib b Ab 


4 681 


CCCGGCGGGA 


TV /*"» rp TV PTVfppPP 

Abl AbAl bbb 


bb 1 bbbAbbb 


b. 1 GbjAvaCU 1 C 


tv p rp m rp tv (-> tv 

pvb 1 1 b 1 AbbA 


PP-TP.PTP,P.Tr. 
bb 1 bb 1 bb 1 b 


A ^ A T 

47 41 


CbAbbrbl bb 


bbb I bbAbAA 


bALbbAtL 1 b 


bL 1 L,L»b»Uob»G 


AbbAlAbbbb 


b/irt.1 1 bbbbb 


a o n i 
4oUl 


AbAb I bAbbb 


bbblAl ityOA 


bbb/iAAHl A 1 


r* cz 7A r* a ^: a a 


bbbbbnbbbn 


b 1 bubbbbnb 


4 o bl 


b I b 1 1 1 bAbb 


13VjAL.oV.dA1 


/-« rp P> f-» P> rp /*"• Tv pi 

C 1 bbb 1 bAbU 


bbbb 1 bb 1 bb 


AbbAAbbl bA 


pp pp rjpi a rza a c 

bb 1 bbftbflhb 


4 921 


GbbbbbTl bb 


TV tv /~» 7V 7v P^ 1 P^ 1 P'O T 1 

AAbAAbbbb 1 


i LbbblbAAI 


1 IbbbbbbbA 


bbbbbbbAAn 


PPPTfl n">rp P*rp rp 
bbb 1 Al X b 1 1 


it Pi Q 1 

4 981 


/—» f~* P" TV P" TV P< p» 

CCCGGACAGG 


prppii TTPP7VP 

b i GA 1 1 bbAL 


GGGGCGGACC 


/-i rprpp*p-« r-i 7\ TV p>p-» 

b 1 1 bbbAAbb 


bbbAX bbAbb 


apappiiPBTP 

HbnbbHbAi b 


5041 


r>rpp""P'<p , p" , p , p*p , rp 

CTGCGCCGGT 


ppivyuPPPTP 
bbAI 1 bbbl b 


p*p , 7\a TyrrrTr 1 
GGAAAGGGGG 


b 1 1 bAb 1 Abb 


Abb 1 1 bAbbA 


bAA i AbbbbA 


5101 


ATTCTCGTCG 


pmmprnmppPP 

CTTGTTCGCG 


x~> 7\ tv TV 7v /~*rp rn /~* 

GGAAAAGTTG 


mTvpt/^P'TV P , P , P , 7v 

TAGbbAb G b A 


r^fT^TK P'P'P'P'TV T 1 

GbbAbbbbAI 


PPTPPTi^ p« TV rp 

bblbbl AbAl 


5161 


y-i >7» m /"1y""v Tl 7\ m y^""* 

GCATGCAATG 


TTGGCCGCTT 


ATGTACGTAT 


ptnPPTV m 7v p* Tv rp 

GTGCATACAT 


7\ TV TVPTVPfTPTVP 

AAAC A 1 I GAC 


tv tv m tv pfrpprnp 
AAIAbibbi b 


5221 


TCATGAaGGA 


P"» TV ^ <"< f~> P"» P> 

GGAGGGGGGG 


GGTGGGTTTC 


GGCCCCT G AC 


/■~» p«rp rp rp TV 

bbTGbb 1 1 GA 


TIPPPPTV /^7\fPO 

1 bbbbAbA I b 


5281 


GACGCCGCAT 


CGTCAGCGGA 


GTCAAGCCTC 


P^P^P" 1 7V P 1 * TV P* TV P^ O 

CCGACAGACC 


rp /"» <^ /~> P* 7V PP P^ 

TGCCGACCCG 


TV P5\TIPPP7\ f~* m 

ACA1 CCbAbl 


5341 


ATCTGTACGT 


tv tv y~« tv mm/"*/-* tv m 

AAGATTGCAT 


tv /*^r*^ tv tv r~> tv tv rn 

ACCAACAATG 


m tv r™» tv /^/^m tv /"i m 

TACACCTACT 


rn /™»r~» m tv f~*f~ > /~* mm 

TCCTACGGTT 


p»p*i A P**P^ 7v mmmrn 

CCAGGA1 111 


5401 


CTCTCTGGAG 


GTTGCATGGA 


/~»/^»/~« TV TV TV m Tl TV 

GCGAAATGAA 


TV TV TV TV TV TV 7V m /"» 

AAAAGAAATG 


CTGCTGCCTG 


TGTTGGAATT 


5461 


r^ tv /■•« tv o^i/im my 

CACAGCCTTG 


j^i i\ m /*■•/■"»/■*• tv r~»r"i 

GGATGCGACC 


TCCCTTTTCA 


r»nr»fnommrin m 

CCGTCTTCAT 


CCTCGTTGGG 


GACTCTCAAC 


5521 


CATGCTGCTG 


TCGCACTGAG 


ATAATACAAG 


GCGTAATTAC 


TGCAGCGGTA 


CGTCGTAATT 


5581 


GGACTTACTT 


TTGTACGACA 


GTTGATGTCC 


AGCGGCATAA 


AAAGCCT C AG 


CCGCCAAGAC 


5641 


TGGCAGACTT 


CTGCAGCCCT 


ATCTTGATAT 


GATCACCCCA 


TAGGCCGAGG 


CCCTGTGCTC 


5701 


GAATCCCGCA 


CGAAGCCGGA 


TTCATGTGTA 


TTCCCAAGGG 


GGTGAGGACG 


GAACTCTTAT 


5761 


TTCGACCTCC 


GGGGGGCCGA 


GTTCTAGTCC 


GCTAACCTTC 


ACGGCTACAC 


CGTCCCTCGC 


5821 


GTCTCAACTA 


GCCATATAAG 


TCCTAGGTAA 


AGAGGTTAAA 


GTAGGTAGGA 


AAGGAACTTG 


5881 


TGGCTTGGCG 


GATCC 
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Figure 7 (SEQ ID NO 7): A. chrysogenum genomic DNA sequence of an approx. 4.6 kb 
region marked by Xbal and SgrAI and containing the biosynthesis genes cefEF (position 
1118 to position 122, inverse arrangement) and cefG (position 2058 to position 3534). A 
single strand in 5'-to-3' orientation is shown. The particular translation start codons (ATG) 
and the particular translation stop codons (TAG, TGA) of the respective coding regions are 
depicted underlined and in bold type. The two introns in cefG and the said cleavage sites are 
depicted underlined. 



1 


TCTAGATACC 


TTGAAACTTG 


AAACTCTTGA 


CACTCATGCT 


TTGTGTTTTA 


AAAATTTAGC 


61 


AATTTATAGG 


GCAATTTTTC 


TTCTTAAACT 


CCCGTAGATT 


TATTACTCGA 


TCGGCGGTTC 


121 


CCTAAGTGGC 


TATAGGAGCT 


GCGGTAGAGA 


CAGGGGCAGC 


CGCGGGGACA 


GCCGCCTCCG 


181 


CTGCCGCCGG 


CTTATCCCTC 


CGCATGTTGA 


CATAGTTCCC 


GCCAAGCCAC 


TCCCTGAACG 


241 


TCGTGCGCTC 


CGACGGGATG 


CGGACGTTGA 


AACCCCACTC 


CCTCGACTGC 


TGCACGTTGA 


301 


AGCTGAAGTC 


GGGCTTCGGC 


CGCAGGAAGA 


AGACGCTCGA 


CGTGCGGCTG 


CTGCCGACGC 


361 


GCTGGTCGCG 


CCCGGGAGAC 


TTGACCCGGT 


GCTTGGGCGC 


CTTGACCTTG 


CCGCCCGTGG 


421 


CCAGGGTGCC 


GACCGCGCCG 


CAGAAGACGA 


CCATGGCGCC 


GGGGAGCGTC 


GGGAGGTCGA 


481 


CGAATTCTCC 


GTCCACCTCG 


CACTGCAGGC 


TCACGAAGCC 


GTTGGCGCAG 


GCTGTCTGGT 


541 


GCACGAGCGT 


GATGGTCGAT 


AGGTCGTAGT 


GGGGTCCCAT 


GCGGAGGGGT 


TCCTCTTCGG 


601 


CGACGCGGTC 


CTCCGGCACT 


TCGGGGAAGT 


ACCGTAGGCG 


GAGGAGGGGA 


TCGCACTCGA 


661 


CGAAGTCATC 


AATGTCCTCC 


CCGGCGAGCG 


GGGCGCCCAC 


AGAGTTGAGA 


ACGGCGCGCG 


721 


CGACATCCTT 


GGCTGCGCCG 


TACATGCGGT 


CGAAGTAGTC 


CTGCCAGACG 


TCCTCGAAGC 


781 


CCCGGTTCGG 


GAACAGGTTG 


CCGCCGATGC 


CCATGGAGTA 


GCACGTCGAG 


TAGTCCGAGT 


841 


ACTTGCCCGT 


CTCGGTGACG 


ACGGCGGTGC 


TCTCCCACTC 


GAGGGCAGAG 


AAGCCGCGGC 


901 


GGGCGTTACG 


GTCGGCGAGC 


GTCACGGCCC 


TCTTCTCCTC 


CTCGCTTCCG 


TTCTTGAAAA 


961 


AGTCAACGCA 


CGTCTCACGC 


GCCGAGGTGT 


GGTCGTCGTC 


GACCAGGCCG 


CTCTCGGTCA 


1021 


AGTAGAAGAT 


ACCCTTGGTG 


GTGACGGCCT 


CGGCGAGCTC 


GGTGAGGACC 


TTGCCGCTCT 


1081 


TGAGGTCGTC 


GAG AC G AAAG 


ACGGGGACCT 


TGGAAGTCAT 


GTTGATGCTG 


TGGTTTTGAG 


1141 


CGATGACTTG 


AGAGGAGTAG 


CGTGGAGGAA 


AGTTCTGCAA 


GAGGAATTTA 


AGGATTCACA 


1201 


AGATCCCAGT 


G AG AAC G AAA 


CGTTGTCAAA 


GCGGTATATA 


TATATCTCAA 


ACCCCACCTC 


1261 


GTAGCTTACG 


CCGAGGAACT 


CCTTTTTAGA 


CAACTGCTAC 


TTAGCCGTAA 


GTGACGGCCT 


1321 


GCTTCCCCTC 


AGCCTTGGCC 


GCACACGTCA 


ATGTAGCATT 


GTAAACCCAC 


GAGTGTCTTG 


7-81 


TGAAGTTTTG 


TCAACGAATC 


AT AAGAAGC C 


ATCGAGTTCT 


CTTCTCGTTC 


TTGGTTCGCA 


j. 4 4 1 


GGAGAAT AT G 


TATCGTGCAT 


GGTCCCTGAT 


CGTCGAGACC 


GCCATGGAAT 


CGTGCAAGCC 


1501 


TTAATTCTCC 


GTACAAGCTT 


CCCCATTCGG 


ACAAGATTGC 


GATGATGTGG 


ATGCGGGCTC 


1561 


TTTTAATAAG 


GACCTTCTTA 


ACCGATGGTC 


CGAGAGTGCC 


TAGGACGGGT 


CCATGTGCAT 


1621 


ACACGACGGA 


CCCTCGACCT 


CCTATTAGGA 


GCATGAGGGA 


CGACAAAATG 


CGAACGACGA 


: c x : 


TGCATCAAAA 


TGCACTGCAA 


CGTCGAGTTG 


TGGGCTACTC 


GCCTTCTGAT 


TCGCAAGCCC 




TCGGCGAGTC 


CACCTACTAG 


TAGCTTGGGA 


AT AAAC AG C A 


AGTTTCGCCG 


CCAAAAGGGC 




TGCCCGGCAT 


CCGATTCGAT 


GCCATTGTAC 


ATCAAGTCGG 


AAATGGTGCT 


CCGTTTCCCC 
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1861 

1921 

1981 

2041 

2101 

2161 

2221 

2281 

2341 

2401 

2461 

2521 

2581 

2641 

2701 

2761 

2821 

2881 

2941 

3001 

3061 

3121 

3181 

3241 

3301 

3361 

3421 

3481 

3541 

3601 

3661 

3721 

3781 

3841 

3901 

3961 

4021 

4081 

4141 

4201 

4261 



CTGGGGTGAG 
TTATCGGATG 
TCGGAGAGTT 
CTTGTCGTGC 
TTCCCCCGAG 
GTCATAACCT 
ACACGTCGCC 
AAGA&ATAGC 
CCGTGGCATA 
GCCACACCTT 
GCAGGGCTTT 
TTGGGAGTGC 
CCAAGTTTCC 
TTGTCTCAAT 



AGGGCGAAGG 
TTGCTTCGAT 
ACTATTCCGG 
AT G AT AG ATG 
TCTCTCCCCG 
ACCTATATTC 
CACCATGTCG 
CAGAATATCG 
CAAATCGTGG 
GACGAGCAGC 
CGATACCTCT 
TGGACCATGT 
TCGCACGACG 
ATCGAGTGGT 



AGTAGTTCGA 
CCGATCGTAT 
GCTTATGCGG 
CTGCCGTCGG 
ATCCCGCACG 
TCATCCCGGC 
CCTCAGATCG 
CTCTTCACAC 
GGTCGCATGA 
GCCCATGTCA 
CGCTACTTCA 
TCACCGGACC 
ATTCGAGATG 
CAGGACAATC 



CCAGTCGCAG 
CCCGCGCGGC 
ACGGGCCGCC 
CCCAAGTGGC 
GGGCCGTCAC 
AAATGCTGCG 
CCAATCGCTT 
TGGAATCTGG 
ATGTCTCAAG 
CCTCGTGGTG 
TCATCTGCCT 
CCGATGCAGA 
ATGTTCGGTA 



CGCACCCAGA 
CTAGATCTTG 
GCCGTCGATG 
CCGTCTAAAG 
TTTCGCTGCC 
GGATAGCCTC 
CGAGGCTTCG 
CGTCATCCTT 
GGATAACTGC 
GCCCACACTG 
AAATTATCTC 
AGGCCAGCGC 
GGTAAGCGCA 



GCCGCAGGTT 
CTAATACGAG 
CCGGCCAAGG 
CCGGACCCCT 
CTCGCTCCTT 
ACCTACAGCC 
CTAGATGCCC 
CGCGATGTAC 
GTCATCGTCT 
TTTGGCCAAG 
GGGAGCCCCT 
CCGTACGGGG 
CCGATCCAGC 



CATCGCCAGG 
ATGGGTGGAA 
GTGCCCATCG 
AGGCAGTGCA 
CCTGTCCGGG 
GCGATGGACG 
TTGCCGTCGG 



TGCTCGACAG 
TGCACACTCT 
CGACATCATG 
TCTATGATGA 
GGCTCGAAAC 
AGCGCTTCCA 
TCGATGCTCA 



GTTAGGCGTC 
GGAATGGGCC 
CCGTCAGAGC 
CCCCAAGTAC 
AGCGCGCAAG 
TATGGCTCCA 
GAGCTAATCA 



CAGGCTAAGC TTTCCGTGTC CAAAAG TATT 
CTGCCGTAGT CGGCGCATCC 
CCGAGTACGT GCGAAAGATT 
CAGCTTGGTT CGAGACACAG 
AGTACGACGT AGACGACCAG 
TCACGTACAA GAGCAAACCT 
CCGGTGAGTT TATAGATGCC 



AGGCAAATTG 
TTCTTTGGTC 
GGCTGGTGCG 
CTGGACGGGG 
ATTGCGAATC 
GGAGTCCAAG 
GACCGAACCC 



GCAGCCAGGA 
GCCAGCCCAT 
GCTTCGACGC 
GAGGCCGGGC 
TTTGCGCCAG 
TCCCAAACAG 
AAGCGGACAA 
ATGGAGGTGT 
GTCTCCATGT 
TCGCAGTAGG 
TTCATAAAGC 
AGGGTCGACT 
TTGACCCATC 
AAACATGCGT 
AGCGGAGGCA 
CGTAGTACCT 
TATCGCCATG 
TGATTCAAGC 
ACGCGTCTCG 
TGAACCTGGG 



GCTGCTAGGC 



TGCGAAGAAG 
TGAAGCCGTA 
CAACTGCTAC 
AGGATCAATC 
GTCAGACGGT 
TCGTCTTTGC 
GGTTAATGAT 
CAGCCTGCCG 
GAGTATGGAT 
TATCAGAACA 
G T C AAAT AAA 
CGGCAGATTC 
CAACTCGGCC 
GGCTCAACTC 
GGCGGGTATC 
ACTCGTATAG 
CTGCTGCGAA 
AGGTTGTCGT 
CTGTCGCGCC 
CGGGAGTGAT 



GAAATCAACG 
TCTTCCTATC 
ATCGCCATGA 
CCGGAGGCTC 
CTGTACTCGT 
GTGGTGGACA 
GCCGTCAGAG 
GTGCGCGTAC 
TTCGCTGTTT 
GCAGCTGAGG 
GCCCAAAGTC 
GAGGCATTGT 
AGAGGCAGCA 
AGCAAGCTCA 
GTAGTAACAC 
AGGCACCGCG 
CACTAACAAA 
GGTCCAGGTT 
TTCCACGACC 
GGAAGGTGCC 



GCACAGACAG 
TCCGGTACCA 
CACTCAAGTT 
TGGCAATGAT 
TTGACGAGCA 
CGAATGAGGG 
GATTCCTCGA 
TTGCCAGGGT 
CCGCTCGGAT 
CCTTCTCGGA 
GCAGTAGACT 
AGCACATTGT 
GGCAAAGCAT 
TTGCCAACGA 
CGTCCACATA 
TCAGGAGAGG 
TGATAAACAA 
TGGTGCCCGA 
CAGAAGTTGA 
TACCCTGTAC 



CGGCAACAGC 

GGCCCAGAAG 

CGACACCCAC 

TACACAACCA 

CGTTGAGATG 

TCATGACTTC 

TCAGTCATTA 

GATCGATGTA 

ATAGGCACTC 

AAGTAGGTTG 

CATCGCATCG 

CGAGGCATTG 

CTCAGCGTAG 

GGTCAAAGAA 

ACACGGGCTC 

TATCAGAACC 

GGGCCCATGC 

GCCCCGCACA 

TGTGCAGAAT 

AACCAACTAC 



CGGAATATCA 

CACCGTGCTG 

TTTGCCGCGA 

GACATCAGCA 

GCGTTGATCA 

GGGCGCAGTA 

TTTGTAATGG 

ATGTGAGGCT 

CTCTCAGATA 

TCAGGCCATC 

TGTCAATAGA 

CAAGTCTCAG 

AGGCGGAGAC 

GCTCCATGCA 

AATAGAAGGT 

AGCGGAGCAA 

CTCATGATTC 

TGTGTGATGA 

GCTGAAGATG 

GGGCAGTGAG 

GTCGGTACTC 
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4321 GTAGGAGCAA TAGCGATGAA GCGTCGGGAG AGAAGTGTGA ATTACTCTGG TACCTGGTAC 

4381 TTGATGCAAC ATAGCACATT TCACCCATCA AAGCTAGGTC CCGCGGCCTG GGAGTGGAAT 

44 41 GGTGAAAGAC ACCGAGGCAA ATGCGGCATG AATGAGGAAG CACGGACGAG TCGTGGTTTC 
4501 ACAAGAGACA CTCTGACCGA CCACAAGATT CGGCAGTACA GTCACAGCAT CACCATCGGC 

45 61 AGTCAGACAT GATTCAGAGC CAGGTCTTCG GCAGAGGGAA TTAGATACAC CTCG GCACCG 
4621 GCG 



